An Information Nutritional Label for Online Documents
نویسندگان
چکیده
منابع مشابه
Information Extraction from Online XML-encoded Documents
Online reference documents tend to be semi-formatted in that they contain repeated sections with similar structure, and have free-text inside each section. XML (extensible markup language) enables document designers to design rich tag sets where tags for section headings contain information about each section. This contextual information, coupled with the fact that the free-text sections of the...
متن کاملMachine Learning for Information Extraction from Online Documents
The eld of information extraction (IE) is concerned with applying natural language processing (NLP) to extract essential details from text documents automatically. Recent results have demonstrated the viability of this idea for collections of journalistic prose in narrow domains. The idea's appeal and applicability, however, is much broader than work to date seems to imply. In particular, the o...
متن کاملA Nutritional Label for Rankings
Algorithmic decisions often result in scoring and ranking individuals to determine credit worthiness, qualifications for college admissions and employment, and compatibility as dating partners. While automatic and seemingly objective, ranking algorithms can discriminate against individuals and protected groups, and exhibit low diversity. Furthermore, ranked results are often unstable — small ch...
متن کاملAn Online System for Automatic Annotation of Audio Documents
This article presents a system for automatic transcription of audio documents. The system includes online implementations of recent algorithms for audio segmentation, speech/nonspeech classification, and speaker clustering, and integrates them with large vocabulary speech recognition systems for both English and French. We also propose a segment-based speech confidence score, and demonstrate th...
متن کاملGeographic Information Retrieval and Visualization of Online Unstructured Documents
Newspapers, travel narratives, blogs, books and the Internet hold a huge amount of geographic information that can be extracted in order to provide visual exploration. Also, the understanding of place references involves knowledge of the document context. In this way, the study of tools for disambiguation is needed. For the automatic annotation of time and location, both shared world knowledge ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM SIGIR Forum
سال: 2018
ISSN: 0163-5840
DOI: 10.1145/3190580.3190588